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METHOD FOR GENERATING RECOMBINANT POLYNUCLEOTIDES 

FIELD OF THE INVENTION 

This invention is directed to a novel method for producing 
recombinant polynucleotides in vitro. 

BACKGROUND OF THE INVENTION 

The process of recombination is of incredible interest because of its 
ability to generate novel sequences possessing improved or desired functions. 
Naturally, this is an indiscriminate process. This has changed, however, with the 
design of procedures capable of achieving these same goals in a more deliberate and 
crafted manner. Procedures mimicking natural recombination events are now 
exploited to specifically generate and select for proteins or sequences particular to 
certain needs. Many methods exist whereby one of skill in the art can carry out these 
processes. 

The difficulty is that many of these methods require a detailed 
understanding of the sequence and structure/function relationship of a particular 
protein of interest and, thus, are very time-consuming and information-intensive; see, 
e.g., Shao and Arnold, 1996 Curr. Opin. Struct. Biol. 6:513-518. Other methods are 
available that circumvent this time and energy expense, however, they have an 
expense of their own in, for example, requiring numerous cycles of selection and 
mutation in the pursuit of a sequence possessing the desired beneficial mutations {see, 
e.g., Kuchner and Arnold, 1997 Trends in Biotech. 15:523-530) or repeated PCR 
cycles in generating recombinant sequences that might be of interest (see, e.g., U.S. 
Patent No. 5,605,793). 

Heteroduplex molecules have been employed in the art as alternative 
substrates to the typical perfectly complementary (homoduplexed) double-stranded 
nucleic acid molecule and have been found to effect more directed recombination 
efforts, avoiding the very information- and/or labor-intensive methods available in the 
art; see, e.g., Lu et al, 1983 Proc. Natl. Acad. Sci. USA 80:4639-4643. Heteroduplex 
molecules in these processes can be created by a number of methods. Most often, the 
heteroduplex molecules are created by heating and annealing of the initial (parental) 
homoduplex nucleic acid molecule substrates. 
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' Repair of these heteroduplex intermediate molecules has been 
employed in methods specifically designed to generate recombinant, novel sequences; 
seee.g., WO 99/29902; Volkov etal, 1999 Nucleic Acids Res. 27(18):el8 i-vi. The 
procedure disclosed in WO 99/29902, specifically, involves preparing a heteroduplex 
5 molecule in vitro from double-stranded nucleic acid molecules differing in sequence 
composition (referred to herein as variant homoduplex parental sequences or variant 
homoduplex molecules). This heteroduplex molecule is then introduced into a 
bacterial cell (E. coli) repair system where repair of the mismatched regions is 
effected resulting in a unique double stranded nucleic acid molecule (or homoduplex) 

10 different from the double-stranded nucleic acid starting material. 

Repair of heteroduplex molecules in the E. coli bacterial cell is known in the 
art to be carried out in vivo through the mut HLS system; see e.g., Lu efal, supra; WO 
99/29902; Volkov et a/., 1999 Nucleic Acids Res. 27(18):i-vi. Mut HLS repair in vitro has 
been described in the art; see, e.g., Fang et a!., 1997 /. Biol Chem. 272(36):22714-22720. 

15 It would be desirable to identify alternative systems capable of generating 

novel polynucleotides via the repair of mismatched heteroduplex molecules, particularly one 
that repairs heteroduplex molecules with a greater consistency and efficiency than the 
bacterial Mut HLS system. More desirable is such a process capable of effecting this repair 
in vitro such that the repair mechanism can be isolated, more closely analyzed, and 

20 optimized. 

SUMMARY OF THE INVENTION 

The present invention relates to a novel method for producing 
recombinant polynucleotides in vitro wherein heteroduplex molecules are contacted 
25 with a mixture comprising a heteroduplex repair system consisting essentially of a 
polymerase, preferably, in the presence of a nuclease. The results achieved using this 
process are superior to that achieved by previous in vivo recombination efforts 
utilizing specific DNA repair systems. 

30 BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 shows the nucleic acid sequences of wild-type lacZa (SEQ 
ID NO:5) and two constructed mutants of lacZatNl and M2; SEQ ID Nos: 1 and 2, 
respectively), aligned. Both Ml and M2 mutants contain four stop codons, two in- 
frame and two out of the reading frame. The Ml variant contains a 27 base mutation 

2 
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region starting at base 50 (from the ATG start) on the Lac Zoc gene, including a 3 base 
deletion and 16 mismatched bases. The M2 variant contains a 22 base mutation 
region with 9 mismatched bases and a 3 base insertion starting at base 156 on the 
gene. The distance between the last in-frame stop codon on the Ml variant and the 
5 first in-frame stop codon on the M2 variant is 81 bases. 

FIGURE 2 illustrates a gel showing the DNA after the nick translation 
reaction (both uncut and cut with restriction enzymes before cloning) and the control 
after having been digested with restriction enzymes. From left, the lanes are as 
follows: 5\xL 1KB Ladder; 2^L digested ssDNA heteroduplex control, 2|jlL digested 
10 dsDNA heteroduplex control; 2|xL digested ssDNA heteroduplex after nick 

translation; 2|nL digested dsDNA heteroduplex after nick translation with ligase; 2\xL 
digested dsDNA heteroduplex after nick translation; 3.5(xL Lac Za insert DNA; 
3.5}iL ssDNA heteroduplex after nick translation; 3.5jjL dsDNA heteroduplex after 

ntrV tranclatinn with licrao^* onH *JnT HcFlMA hAtar/vliinlAv ofter r»tob- tronelotirm 

15 HGURE 3 shows the nucleic acid sequences of wild-type lacZa(SEQ 

ID NO:5) and two additional constructed mutants of lacZa(M3 and M4; SEQ ID 
Nos: 3 and 4, respectively), aligned. Mutants M3 and M4 contain four stop codons, 
two in-frame and two out of the reading frame. Neither mutant has any insertions or 
deletions of bases relative to the wildtype Lac Za gene. M3 contains a 14 base 

20 mutation region with 12 mismatched bases relative to wildtype while M4 has a 16 
base mutation region and 8 mismatched bases. There are also 81 bases between the 
end of the last stop codon in M3 and the beginning of the first stop codon in M4. 

FIGURE 4 illustrates, diagrammatically, the formation of heteroduplex 
molecules with mutants Ml and M2. 

25 FIGURE 5 shows a protocol for nicking the heteroduplex molecules of 

Figure 4. 

HGURE 6 shows the activity, diagrammatically, of DNA Pol I on the 
nicked molecule of Figure 5. 

FIGURE 7 shows the expected results using the blue/white screening 
30 assay discussed below. 

FIGURES 8 A and 8B illustrate a general reaction carried out using the 
disclosed methods. 
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DETAILED DESCRIPTION OF THE INVENTION 

As used throughout the specification and claims, the following 
definitions apply: 

"recombinant polynucleotides", "recombinant sequences", etc., refers 
5 to sequences, derived from recombination efforts disclosed herein, that differ in 
sequence composition from the original parental sequences (the variant homologous 
parental sequences, see below). 

"recombination" refers to any process whereby a chimeric sequence is 
generated from the parental sequences or sequences derived therefrom. 
10 "novel" or "chimeric" with respect to a particular polynucleotide refers 

to a sequence that differs in sequence composition from the variant homologous 
parental sequences (defined below) employed in a particular reaction. 

"variant homologous parental sequences" refers to sequences that 
differ in sequence composition yet possess a degree of homology sufficient to allow 
15 hybridization of complementary DNA strands of the parental sequences. One of 

ordinary skill in the art is fully aware that hybridization conditions will vary based on 
the level of homology; see,' e.g., Stemmer et al, 1994 Proc. Natl. Acad. Set USA 
91:10747-10751, wherein conditions were modified to accommodate a very low 
effective annealing temperature in order to generate chimeras from a human and a 
20 murine TL-l$ gene possessing areas of sequence identity of on average only 4.1 bases 
long. 

"conditions which promote heteroduplex formation" refers to any 
conditions that allow for heteroduplex formation between complementary strands of 
the parental homologous sequences. 
25 "mixture" refers to any combination of reagents (e.g., enzymes, 

buffers, etc.) or materials employed within the instant invention. 

"heteroduplex repair system" refers to the enzymes or enzyme used to 
effect recombination of the heteroduplexes by means of resolving mismatches 
between the two variant strands of the heteroduplex molecule in order to bring about 
30 the formation of recombinant polynucleotides. 

"nuclease" refers to any protein capable of hydrolyzing a peptide bond 
within a nucleic acid sequence. 

"polymerase" refers to any protein capable of catalyzing the addition of 
a nucleotide to the 3 1 end of a nucleic acid molecule chain. 
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"Pol I" refers to DNA polymerase L 

"heteroduplex" refers to a double-stranded nucleic acid molecule wherein the 
composite strands are not 100% complementary. Preferably, they are substantially 
complementary (greater than 50%, more preferably, greater than 70%, even more preferably, 
5 greater than 80% and most preferably, greater than 90%). It is this molecule that is subject to • 
the polymerase preferably with the nuclease. It is important that this be distinguished from 
that employed in Stemmer (U.S. Patent No. 5,605,793). Therein the key intermediate was 
not a heteroduplex molecule as above described but, rather, a molecule wherein a particular 
strand of said molecule is derived from numerous fragments of the parental sequences. 
10 "thermal melting point" or "TnT refers to the temperature at which 

approximately 50% ±2% of a set of complementary probes hybridize to a target 
sequence at equilibrium under defined ionic strength, pH and nucleic acid 
concentration. 

15 In accordance with the instant invention, Applicants have discovered a novel 

process wherein recombinant polynucleotides can be produced in vitro using a process 
employing a polymerase and, preferably, a nuclease. These enzymes have been found to 
resolve mismatches between variant heteroduplex strands and, in the process, generate novel 
sequences, i.e., sequences differing in sequence composition from the variant homologous 

20 parental sequences used in a particular reaction. 

The polymerase, preferably in combination with the nuclease, is the main 
effector of the heteroduplex repair system. The repair system, however, can also comprise 
any other enzymes found to contribute to the heteroduplex repair process, so long as the 
actual recombination effects (i.e., generation of recombinant polynucleotides via resolution of 

25 mismatched nucleotides within the heteroduplex) is a direct result of the activity of the 
polymerase (in the case of a nicked heteroduplex molecule), or the polymerase and the 
nuclease. 

A means of producing recombinant polynucleotide sequences from 
heteroduplex DNA which specifically employs and relies on nick translation enzymes 
30 (particularly, DNasel, an endonuclease, and Pol I, a polymerase) or, alternatively, solely a 
polymerase (wherein the heteroduplex molecule has already been nicked) is new. The 
predominant method of use in the art for generating recombinant sequences from a 
heteroduplex has been a system employing the mutH, mutL and mutS enzymes of E. coli\ 
e.g., see WO 99/29902.. 
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The instant invention, therefore, relates to a method for producing 
recombinant polynucleotides, which comprises providing variant homologous parental ♦ 
sequences; incubating said sequences under conditions which promote heteroduplex 
formation; contacting said heteroduplexes with a mixture comprising a heteroduplex repair 
5 system consisting essentially of a nuclease (preferably, an endonuclease) and a polymerase; 
and identifying the recombinant polynucleotides produced. Alternatively, and additionally, 
the instant invention relates to a method for producing recombinant polynucleotides, which 
comprises providing variant homologous parental sequences; incubating said sequences 
under conditions which promote heteroduplex formation wherein the formed heteroduplexes 

10 are nicked; contacting said heteroduplexes with a mixture comprising a heteroduplex repair 
system consisting essentially of a polymerase; and identifying the recombinant 
polynucleotides produced. Additional methods provide a means of repairing mismatched 
nucleic acid molecules in vitro which comprises contacting the mismatched molecule with a 
mixture consisting essentially of a polymerase (wherein the mismatched molecule is nicked) 

15 or a polymerase and a nuclease. 

The essence of the invention is that the reaction of nick translation is 
exploited in vitro to resolve mismatches between heteroduplex DNA derived from 
variant homologous parental sequences. Nick translation is known in the art primarily 
as a means of removing RNA primers or, alternatively, for producing uniformly 

20 radioactive DNA of high specific activity, and is thereby noted for the preparation of 
sequence-specific probes, for genomic DNA blots and for RNA blots. The concept of 
using the nick translation machinery in the generation of recombinant sequences from 
mismatched heteroduplex molecules is novel. Applicants were the first to adopt this 
very specific reaction to a very useful purpose - that of generating unique, potentially 

25 more desirable, sequences; sequences which perhaps encode proteins of improved 
properties. 

Throughout the disclosure, it is to be noted that elements other than the 
nuclease and the polymerase (e.g., buffer materials, dNTPs, H20, etc.) can be added to the 

heteroduplex repair system to facilitate a specific reaction, hence the term mixture within the 
30 claim. This does not deny, however, that the repair mechanism is carried out as a direct 
result of the actions of the polymerase (in the event of a nicked heteroduplex) or the 
polymerase and the nuclease. Accordingly, the mixture must specifically comprise the 
heteroduplex repair system discussed above (mainly, the polymerase or polymerase/nuclease 
combination) but can add other reagents to impact the repair process in a desired manner. 

6 
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The disclosed method was utilized in efforts to shuffle genetic material 
amongst two non-functional mutants of Lac Za Briefly, blue/white screening was conducted 
in the presence of X-Gal. Any recombination between the two mutant Lac Za genes which 
resulted in a wildtype LacZa was identified by the release of an indigo blue product. Release 
5 of the indigo blue product was dependent on cleavage of the X-Gal by P-galactosidase that is • 
encoded by wildtype Lac Za. Non-functional mutant colonies appear white when expressed 
in the presence of X-Gal. By contrast, a successful experiment (where both mutations are 
removed from the heteroduplex DN A) reverts effected colonies back to a blue wildtype 
phenotype in the presence of X-Gal. Reversion to the wildtype as tested herein is one manner 

10 in which to readily detect and report specific recombination events; see, e.g., Stemmer (U.S. 
Patent No. 5,605,793). 

The results achieved employing the instant process were ks follows: the nick 
translation reaction resulted in 5 blue colonies out of 524 colonies, for a 0.95% reversion 
frequency; whiie a heteroduplex control resulted in 7 biue colonies out of i,032 colonies, for 

15 a 0.54% reversion frequency (roughly half as efficient as the nick-translated DNA). Similar 
experiments were run, finding a 0.8% reversion frequency (16/1,911) for the nick-translation- 
treated heteroduplex molecules, and a 0.2% reversion frequency (2/1,01 1) for the 
heteroduplex control group (roughly 1/4 as efficient as the nick-translated version). The 
heteroduplex control employed in the examples was not subjected to nick translation 

20 processes but, simply, digested with restriction enzymes, ligated into a vector and 

transformed into E. coli cells, thus following the protocol of the nick-translated heteroduplex 
molecules all except for contact with the heteroduplex repair system (mainly, the polymerase 
or polymerase/nuclease combination). Thus, the mismatched heteroduplex control was 
subject only to cellular (JE. coli) repair processes and, therefore, epitomized that employed in 

25 Arnold et al , WO 99/29902. 

The recombinant polynucleotides of the instant invention are generated from 
heteroduplex molecules derived from variant homologous parental sequences. Variant 
homologous parental sequences of use in the instant invention are any sequences that differ in 
sequence composition (by at least one nucleotide) yet possess a degree of homology 

30 sufficient to allow hybridization of complementary sequences. Variant homologous parental 
sequences can be mutant forms or different alleles of the same sequence, they can encode the 
same protein but be present within different organisms or, alternatively, they can be of two or 
more different genes with a degree of homology sufficient for hybridization. The variant 
homologous parental sequences can further be present on vector DNA, examples of which 
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are plasmids, cosmids, BACs (bacterial artificial chromosomes) and YACs (yeast artificial 
chromosomes). Preferably, the variant homologous parental sequences are from 50 to 
150,000 basepairs, more preferably, 50 to 10, 000, and most preferably, from 300 to 2000 
basepairs. The sequences can also vary in length with respect to each other. Preferably, the 
5 sequences vary in length from about 50 to 5,000 basepairs. Most preferably, the sequences . 
vary in length from 0-2,000 basepairs. The true goal here is sufficient hybridization in 
heteroduplex form in order to allow for a polymerase to act, preferably, in concert with a 
nuclease, to generate unique DNA from the heteroduplex. 

As one of skill in the art will appreciate, hybridization conditions are 

10 generally sequence-dependent and vary according to the desired reaction. Preferably, 
hybridization conditions are determined based on the sequence and are roughly 25°C 
lower than the thermal melting point (Tm) for the sequence of interest at a defined 
ionic strength and pH; all of which is readily determined by one skilled in the art. 
One of ordinary skill in the art wiii further appreciate that the annealing temperature 

15 can be varied in order to accommodate heteroduplex formation between variant 
sequences; see, e.g., Stemmer et al, supra, wherein conditions were modified to 
accommodate a very low effective annealing temperature in order to generate 
chimeras from a human and a murine IL-ip gene possessing areas of sequence 
identity of on average only 4. 1 bases long. 

20 The variant homologous sequences can exist in double-stranded or single- 

stranded form. Furthermore, they can be DNA (e.g., PCR product, genomic or cDNA) or 
RNA or any analog thereof. In the event that RNA is used, the RNA is transcribed to cDNA 
prior to heteroduplex formation for annealing with other DNA or cDNA products. 

Most preferably, the sequences incorporated into the reaction possess a high 

25 degree of homology. Preferably, the sequences are at least 50% homologous, more 
preferably, at least 70% homologous and, most preferably, at least 90% homologous. 
Particularly preferred embodiments include variant homologous sequences with at least 99% 
sequence identity. Preferably, at least two sequences which differ in at least two basepair 
positions will be present in order to allow for the generation of a unique sequence from a 

30 recombination event. 

The instant invention was demonstrated with a system wherein the variant 
homologous parental sequences were derived from the Lac Zoc gene, particularly the mutants 
Ml, M2, M3 and M4 (SEQ ID Nos: 1-4, respectively) of Lac Za. This system is employed in 
the art {see U.S. Patent No. 5,605,793) as an efficient, reliable means to demonstrate the 
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viability of a particular gene shuffling method. As one of ordinary skill in the art will 
appreciate, though, this is not meant to impact its applicability to any of a number of "variant 
homologous sequences" as defined in the instant specification. It is easily understood that 
this system is amenable to use with any number of sequences that differ in sequence 
5 composition yet possess a degree of homology sufficient to allow hybridization of 
complementary DNA strands of different parental sequences. 

Variant homologous parental sequences in accordance with the instant 
invention can be generated in any number of ways or they can be natural variants. Some 
preferred methods of producing variants are error-prone PCR, cassette mutagenesis, UV 

10 mutagenesis, site-directed mutagenesis, chemical mutagenesis, and in vivo mutagenesis; all of 
which are known in the art. One of skill in the art can appreciate, though, that any means 
capable of producing variant homologous sequences as defined in the instant disclosure is 
suitable for use in the instant invention. 

Once the sequences are chosen and available, the sequences are placed 

15 under conditions in order to allow heteroduplex molecules to form from the variant 
parental sequences. A number of means exist in the art to generate heteroduplex 
DNA from homologous parental sequences; all of which can be used in the instant 
invention. The determination of conditions suitable for the heteroduplex formation 
for a particular set of sequences is, further, well within the realm of skill of one of 

20 ordinary skill in the art. 

Preferred embodiments generate heteroduplex DNA by the heating and 
annealing of variant homologous sequences. More preferably, two homologous PCR 
products (PCRed for amplification purposes) containing desired mutations in 
appropriate buffer are heated and annealed together. PCR, in this instance, is used for 

25 the purpose of amplifying the initial substrates, the variant homologous parental 

sequences. It is not a necessary step of the actual recombination process. The instant 
process does not mandate the use of primers, nor does it depend on at least three 
rounds of PCR as does the method of Stemmer (U.S. Patent No. 5,605,793). 

Another means of producing heteroduplex DNA involves amplifying 

30 single-stranded plasmid DNA by use of M13-derived vectors and helper phage. In 
this embodiment, two complimentary single-stranded plasmids containing two 
homologous genes amplified in this manner are joined to create heteroduplex DNA 
and the nick translation reaction is run using entire plasmid. 
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Another preferred method of producing heteroduplex DNA involves 
running separate single strand PCR reactions for each of the homologous genes. In 
this manner, a low homoduplex background is produced. One reaction is run with 
only the forward primer using a parental sequence PCR product as template and the 
5 other reaction is run with the reverse primer and a "variant homologous parental 
sequence" (see definitions) PCR template. The forward and reverse single-stranded 
PCR products containing the two homologous parental sequences are then joined to 
create heteroduplex DNA. This heteroduple/DNA is then subjected to a nick 
translation reaction. The resultant product is then cut and cloned into a suitable 

10 plasmid expression vector. 

In an alternate preferred embodiment, one mutant plasmid is digested 
with a unique restriction enzyme upstream of the gene (e.g., EcoRI) and the other 
mutant plasmid is digested with a different unique restriction enzyme downstream of 
the gene (e.g., EcoR0109I). The resulting linear fragments are combined, heated to 

15 94°C to denature the DNA strands and cooled back to room temperature. 

Heteroduplex DNA under these conditions circularizes and homoduplex DNA 
remains linear. Thereafter, heteroduplex DNA is easily distinguishable from 
homoduplex DNA and can be physically separated by agarose gel electrophoresis and 
cutting out the circular heteroduplex band from the agarose gel. This purified 

20 heteroduplex plasmid DNA may be subjected to a nick translation protocol and, after 
purification, directly transformed into a suitable expression host. 

Results achieved using this embodiment with M3 and M4 mutants, 
SEQ ID Nos: 3 and 4, respectively, were as follows: in XL-1 blue cells , the nick 
translation reaction resulted in 27 blue colonies out of 212 total colonies, for a 

25 12.74% transformation efficiency; while the heteroduplex control resulted in 18 blue 
colonies out of 364 total colonies, for a 4.95% transformation efficiency; in XL mutS 
cells , the nick translation reaction resulted in 55 blue colonies out of 385 total 
colonies, for a 14.29% transformation efficiency; while the heteroduplex control 
resulted in 38 blue colonies out of 510 total colonies, for a 7.45% transformation 

30 efficiency. 

The subsequently described steps, preferably, are carried out with a 
DNA fragment that has been purified, preferably, by gel electrophoresis, although the 
reaction can also be carried out on plasmid or phage vector DNA sequences. 
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Following heteroduplex formation with the variant homologous 
parental sequences, the heteroduplexes are contacted with a mixture comprising a 
heteroduplex repair system consisting essentially of a nuclease and a polymerase or, 
in the instance that a nicked heteroduplex is present, a polymerase. The reaction 
5 carried out by a nuclease and polymerase, particularly with DNasel and Pol I is akin 
to that of nick translation and is, thus, referred to as such throughout the specification. 

The enzymes behind this reaction, a polymerase and a nuclease, or 
alternatively, the enzyme (polymerase) when the resultant heteroduplex molecule is 
nicked, are noted as crucial to this process, specifically by carrying out the 

10 transcription necessary to resolve mismatches between the variant heteroduplex 
molecule strands and form a unique double-stranded homoduplex molecule. 
Accordingly, the combination of these molecules (the polymerase and nuclease) or, in 
the alternative, the polymerase (when the heteroduplex is nicked) is referred to as a 
heteroduplex repair system. 

15 While the term "heteroduplex repair system" was coined specifically in . 

recognition of the respective actions of a nuclease and a polymerase on heteroduplex 
molecules or, alternatively, the action of a polymerase on a nicked heteroduplex 
molecule, this term is specifically meant to include all other additions to these 
enzymes which facilitate mismatch repair but do not impact the essential purpose of 

20 the two enzymes (i.e., the polymerase and the nuclease) in this invention. In other 
words, the term "heteroduplex repair system" as used within the instant disclosure 
refers to any combination of proteins or enzymes wherein the polymerase and 
polymerase/nuclease combination are considered to be primarily responsible for the 
heteroduplex repair activity discussed above. This definition, therefore, speaks only 

25 to situations wherein another enzyme or protein is not considered an essential 
contributing factor to the observed activity. 

Polymerases of use are known in the art. Preferably, the polymerase is 
selected from the following: E. coli DNA polymerase, Klenow fragment; reverse 
transcriptase; T4 DNA polymerase; Native T7 DNA polymerase; chemically modified 

30 T7 DNA polymerase; genetically modified T7 DNA polymerase (A28); Pfu DNA 

polymerase; KlenTaq (Ab PeptidesTM) DNA polymerase; and Taq DNA polymerase. 
Mutants of DNA polymerases, for instance, mutants of DNA polymerase I are also 
useful in the instant invention. Accordingly, Pol A5 is also useful in the instant 
invention. 
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Most preferably, the enzyme is DNA Pol L With regard to the 
polymerase, the time of the reaction is important. Nucleotide replacement along the 
gene must be kept to a minimum so that read-through by the polymerase on one 
strand does not negate read-through by other polymerase molecules on the other 
5 strand. As one of skill in the art will appreciate, the reaction time can be adapted for 
optimal results and is related to the concentration of enzymes and reagents in the nick 
translation reaction. Most preferably, the nick translation reaction is run for a time 
between 5 minutes and 2 hours. 

DNase I, or any other nuclease (be it an endonuclease or an 

10 exonuclease) capable of hydrolyzing a double-stranded DNA molecule (e.g., a 

restriction enzyme), is the other crucial part of the nick translation process, where the 
molecule has not already been nicked. Preferably, the enzyme used is DNasel, more 
preferably DNasel in the presence of Mg2+, which randomly nicks duplex DNA. The 
amount of DNasel (or the specific nuclease utilized) can be optimized to the specific 

15 application. For example, nick translations of short DNA fragments (<500 basepairs) 
may require greater concentrations of DNasel to ensure all molecules are nicked at 
least once. Preferably, a stock solution is prepared of approximately l-3|xL 
DNaseI/20-300|iL IX NT buffer in 50% glycerol is preferred. Varying amounts of 1- 
IOjxL of this stock can be used, for instance, in a nick translation reaction of 

20 approximately 125jjL. Most preferably, the DNasel is presented in the form of a 

DNasel stock which is 3pL of DNasel (Stratagene 100,000 U/mL) diluted with 169yL 
of IX NT buffer in 50% glycerol. As one of ordinary skill in the art will appreciate, 
however, this is all dependent on the specific reaction being carried out. The goal is 
to nick the heteroduplex DNA in an amount sufficient for the polymerase to enter and 

25 mend the DNA mismatches. Caution must be taken, however, not to degrade the 

DNA. Optimization parameters are readily understood and mastered in the art and are 
further discussed in e.g., Current Protocols in Molecular Biology (Ausubel et al, eds., 
1997). 

It is important to note that similar end results to that described above 
30 can be achieved with nicked heteroduplex DNA (instead of a nuclease and 

heteroduplex DNA) and a polymerase. The nicks present in the heteroduplex DNA in 
this instance accommodate the absence of the nuclease. 

Preferred embodiments of the above invention, further, employ DNA 
ligase in order to seal nicks present in the heteroduplex molecule. This can be 
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effected by treatment with DNA ligase under standard ligating conditions. In 
particularly preferred embodiments, the DNA ligase employed is T4 DNA ligase. 
Ligase can be added prior to, during or after the nick translation reaction. 

The reaction conditions (e.g., the volume of reaction, concentration of 
5 the 4 dNTPs, and amount of DNA) can be varied as will be appreciated by one of 
ordinary skill in the art; see, e.g., Current Protocols in Molecular Biology, supra. For 
example, one of ordinary skill in the art is aware that a nick translation reaction can be 
carried out with as little as 20 ng of DNA and can be scaled down to volumes as small 
as 5jiL; Id. Moreover, concentrations of dNTPs as low as 2jxM is sufficient for E. coli 
10 DNA Polymerase I, although, the polymerase is more efficient when supplied with 
higher concentrations of substrates; Id.. 

The temperature of the reaction, as one of ordinary skill in the art will 
appreciate, is adjusted according to the particular reaction substrates, enzymes, and 
conditions employed. 

15 Upon completion of the nick translation reaction, the reaction products 

are purified, digested with unique restriction enzymes and inserted into an appropriate 
plasmid vector. The plasmid vector with insert is then inserted into a suitable host for 
expression. The host cells are then screened to identify clones containing the desired 
mutations. 

20 One of ordinary skill in the art is aware that any of a variety of 

expression vectors can be used to effect expression of the recombinant 
polynucleotide. Particularly preferred expression vectors include pUC18 and its 
derivatives, pUC19 and its derivatives, pBR322 and its derivatives, the pBluescript 
series, the pGEM series (PromegaTM), pET series (PromegaTM), and pESP-1 

25 (StratageneTM). Generally, the specific choice of vector will depend upon the cell 
type used, the level of expression desired, and the like. 

Host cells may be prokaryotic or eukaryotic, including but not limited 
to, bacteria such as E. coli, fungal cells such as yeast, mammalian cells including, but 
not limited to, cell lines of human, bovine, porcine, monkey and rodent origin, and 

30 insect cells including but not limited to Drosophila and silkworm derived cell lines. 
Cells and cell lines of particular interest are derived from E. coli K12, £. coli B, 
Bacillus, and Streptomyces. 

Preferably, the screening and selection process is mediated by various 
markers known in the art, e.g., through luciferase, p-galactosidase, and green 
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fluorescent protein, although any method which would detect a novel activity, 
function or quality of the recombinant polynucleotide is suitable. Any screening and 
selection are designed according to what is specifically sought in the recombinant 
polynucleotide. For instance, an enzyme possessing a particular function or level of 
5 function can be tested by measuring or identifying the actual function (or an indicator 
thereof) of the recombinant protein. One of skill in the art can accommodate a search 
for any of a range of features into the particular screening. For instance, nucleotide 
sequences capable of binding to a specific protein can be sought with the labeled 
protein, or a specific characteristic or quality of the expressed product can be tested 

10 according to the features of the quality sought. The use of the instant invention for 
generating recombinant sequences with enhanced features relative to the parental 
sequences employed is the most significant use described herein. 

A further embodiment wherein the methods of the instant invention 
can he used is for enhancing directed evolution by PCR mutagenesis and other 

15 processes (e.g., error-prone PCR mutagenesis) designed to accrue sequences 
possessing variations with respect to a particular target sequence. For instance, 
mutants bearing different beneficial point mutations discovered in one round of 
directed evolution can be shuffled to generate a library of sequences where one or 
more of the library members contain collections of mutations onto single genes. 

20 These mutants are identified by screening for the desired beneficial activity. This cuts 
down on both the time and effort required to, one, collect mutations onto one gene 
using sequence information and site-specific PCR and, two, to gather multiple 
mutations by performing additional rounds of PCR mutagenesis and screening. 

This enhancement is also applicable to random methods of generating 

25 novel polynucleotide sequences. 

The following non-limiting Examples are presented to better illustrate 

the invention. 

30 EXAMPLE 1 

Preparation of PCR Products as Substrates for Heteroduplex DNA Reaction 

Two PCR reactions were performed to amplify both the Ml and M2 
mutants of the Lac Za gene. The reactions were carried out in the presence of 
601.6jaL H 2 0; 80mL 2mM dNTPs; 80>iL 10X PC 2 Buffer (Ab PEPTEDESTM) 
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consisting of 50mM Tris-HCl pH 9.1, 16mM ammonium sulfate, 3.5mM MgCk, and 
150^g/mLBSA; 16pLpUC18 Fl (40fiM): (5 ' -TG AGCGTCGTTTTTGTG AT; SEQ 
ID NO:6); 16nL pUC18 Rl (40|jM): (5'- TAGGGGTTCCGCGCACATTT; SEQ ID 
NO:7); 2A\xL POL MIX consisting of 15:1 Klentaq (Ab PEPTIDESTM) : PFU 
5 Polymerase (STRATAGENETM); and 4^L Template DNA (PROMEGATM 
miniprep of pUC18 plasmid Ml or M2 (M1:SEQ ID NO:l; M2:SEQ ID NO:2) 
~200ng/(jL). A total volume of 800nL was divided into eight 0.6mL PCR tubes and 
run on a thermal cycler using the following program: 94°C for 2minutes; 25 cycles of 
[94°C for lminute; 52°C for lminute; 72°C for lminute, lOseconds], 72°C for 

10 7minutes; and 6°C until purified. 

PCR Products were purified using PROMEGATM PCR Preps Kit with 
Wizard Minicolumns. 200pL of PCR reaction was purified per minicolumn and the 
product was eluted with 50jiL of H 2 0. A total volume of 200jiL was generated for 
both Ml and M2 pUCIS F1R1 PCR products. 

15 A 2 6o of PCR products was measured using a Spectra Max Plus 

spectrophotometer (MOLECULAR DEVICESTM) to determine concentration of 
DNA. The results were as follows: Ml pUC18 F1R1 =175 ng/pL; and M2 pUC18 
F1R1 =208.75 ng/jiL. 

20 EXAMPLE 2 

Heteroduplex DNA Formation 

170 ^L of Ml P UC18F1R1; 143 ML of M2 P UC18F1R1; and 16.47 |nL 
of 20X TrisKCl heteroduplex buffer (0.3 M tris pH 7.83, 1.2 M KC1) were mixed and 
the contents divided into 3X 0.6 mL PCR tubes. The tubes were then set on a thermal 
25 cycler for the following heteroduplex program: 

94 °C for 4 minutes; ramp to 90 °C and hold for 10 minutes; ramp to 85 °C and 
hold for 10 minutes; ramp to 80 °C and hold for 10 minutes; continue this 
ramp down in temperature at 5 °C increments and hold 10 minutes each until 
60 °C is reached and then ramp to 40 °C and hold 10 minutes; then 4°C until 
30 reaction is purified. 

Note that all temperature ramps are performed at 0.1 °C/sec. 
Heteroduplex DNA was purified using three Promega Wizard columns with the PCR 
prep Kit, and DNA was eluted in H2O for a combined volume of 150 nL of 

heteroduplex DNA. 
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EXAMPLE 3 

Nick Translation Reaction 

The following reagents were mixed on ice: 40 jjL of pUC 18 FIR 1 
5 purified heteroduplex DNA; 40 pL of H2O; 12.5 \\L of 10X NT buffer (0.5M TrisHCl 
pH 7.5; 0.1M MgCl2; 10mM Dithiothreitol (DTT); 0.5 mg/mL BSA); 5|uL of DNA 
Pol I (New England Biolabs); 22.5 |xL of 2 mM dNTPs; and 5pL of IX DNasel stock 
(dilute 3nL of DNasel (Stratagene, 100,000 U/mL) with 169yL of IX NT buffer in 
50% glycerol). 

10 The nick translation reaction was run at 14°C for 15 minutes. The 

reaction was stopped with addition of 2 |uL of 0.5 M EDTA pH 8.0. The nick 
translation reaction was purified using Promega PGR preps kit and DNA was eluted 
with 60|iLofH20. 

15 EXAMPLE 4 

Restriction Digest of Nick Translation DNA and Heteroduplex Control 

The following reagents were combined: 52juL of nick translation DNA; 
7 ML of NEB 4 buffer (NEB; New England BiolabsTM); o.7^L 10X BSA (NEB); l^L 
of EcoRI (NEB); 4\xL of Eco 01091 (NEB); 4yL Sap I (NEB); and 2nL of H2O. 

20 A heteroduplex control was prepared by combining the following 

reagents: 25\xL of pUC 18 F1R1 heteroduplex DNA (same stock as used in nick 
translation reaction); 7|wL of NEB 4 buffer (NEB); 0.7|iL 10X BSA (NEB); l^L of 
EcoRI (NEB); 4yL of Eco 01091 (NEB); 4^L Sap I (NEB); and 28 \iL of H2O. 

The rest of the reactions were carried out with both the nick translation 
25 DNA and the heteroduplex control separately. 

Restriction reactions were run at 37°C for 5 hours. 

Nick translation and control Lac Za insert bands were isolated from 
0.8% agarose gel using a razor blade. The gel fragment was purified with Promega 
PCR Preps kit and the inserts were eluted with 50\xL of H2O. 

30 

EXAMPLE 5 

Ligation Reaction 

The following reagents were combined: 0.5^L pUC18 vector DNA 
with Lac Za insert removed with EcoRI and Eco 01091; 5|uL of F1R1 control insert 
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DNA or 3\xL of nick translation insert DNA; 1\jlL of 10X Ligation Buffer (Boerhinger 
Manneheim); 0.75|xL of T4 DNA Ligase (Boerhinger Manneheim); and 2.75fiL of 
H2O for control or 4.75^L of H2O for nick translation control. 

Ligation reactions were run at 16°C overnight. 

5 

EXAMPLE 6 

Transformation Reaction 

XL-1 Blue competent cells (Stratagene) were defrosted and 50fiL was 
aliquotted per 0.6 mL PCR tube. 5|xL of ligation reaction was added to each tube and 

10 0.75pL of pUC 18 (Stratagene) plasmid was added as a control. The DNA was 

incubated with E. coli on ice for 30 minutes. The cell/DNA mixture was then put on 
a PCR block for 1 minute, 30 seconds, at 42°C. The heat shocked cell/DNA mixture 
was then immediately placed on ice for 2 minutes. 450^iL of S.O.C. (Gibco 
BRL)/tube was then added and the tube contents were then transferred to 15mL tubes 

15 (Falcon) and shaken at 37°C for 1 hour. Two 250 |iL aliquots of each transformation 
and one 30 nL aliquot of the wildtype pUC 18 control were then plated onto LB Agar 
Ampicillin and X-Gal and placed at 37°C overnight. 

EXAMPLE 7 

20 Results 

The number of colonies on half the plate were counted and multiplied 
by two to approximate the number of colonies per plate. The total number of blue 
colonies on each plate were counted. For the pUC 18 control plate, all colonies were 
counted. The results were as follows: the nick translation reaction resulted in 5 blue 
25 colonies out of 524 total colonies, for a 0.95% reversion frequency; the heteroduplex 
control resulted in 7 blue colonies out of 1,032 total colonies, for a 0.54% reversion 
frequency; and the pUC 18 control resulted in 45 blue colonies out of 47 total 
colonies, or 95.74% blue colonies. 

30 EXAMPLE 8 

Nick translation on two pUC 18 mutants lacking any insertions or deletions of bases 
relative to the wildtype Lac Za genome. 

Two pUC 18 mutants of Lac Z alpha M3 and M4 (M3:SEQ ID NO: 3; 
M4; SEQ ID NO:4) were constructed without any insertion or deletion of bases 

17 



WO 02/24953 



PCT/US01/29030 



relative to the wildtype genome. Heteroduplex DNA was generated in accordance 
with the method described in Arnold et al, WO 99/29902 (see also, Westmoreland et 
al y 1997 Gene/icsl45:29-38; which are both hereby incorporated by reference. 

Briefly, one mutant plasmid was digested with a unique restriction 
5 enzyme (EcoRI) upstream of the gene and the other mutant was digested with a 
different unique restriction enzyme (Eco0109I) downstream of the gene. The linear 
fragments were combined, heated to 94°C to denature the DNA strands and cooled 
back to room temperature. Heteroduplex DNA (one strand nicked with EcoRI, and 
the second strand nicked by Eco0109I) circularized and homoduplex DNA (both 
10 strands cut with either EcoRI or Eco0109I) remained linear. In this manner, 

heteroduplex easily distinguishable from the homoduplex DNA can be physically 
separated from the linear homoduplex by gel electrophoresis and by cutting out the 
heteroduplex band from the agarose gel. The heteroduplex DNA can then be purified 

iiQirifT Prnm^rroTM PPP Prorte Vit 
— * o" - - "f" — - 

15 The above experiments were carried out to generate heteroduplex 

plasmid DNA. Some of this heteroduplex plasmid DNA was contacted with nick 
translation enzymes as in Example 3 above. A portion of the heteroduplex plasmid 
was not contacted with nick translation enzymes such that it could be used as the 
heteroduplex negative control. The two plasmid preparations were transformed 

20 separately into both XL-1 Blue and XL mutS (a strain with the mutS gene knocked 
out). The results in the mutS cells were slightly better than in XL-1 blue cells 
suggesting that the mutHLS system is not involved in the recombination. In these 
experiments the total number of blue and white colonies were counted in a defined 
area on the agar plates. 

25 The results were as follows: in XL-1 blue cells , the nick translation 

reaction resulted in 125 blue colonies out of 1,03 1 total colonies, for a 12.12% 
reversion frequency; while the heteroduplex control resulted in 82 blue colonies out 
of 1,315 total colonies, for a 6.24% reversion frequency; in XL mutS cells , the nick 
translation reaction resulted in 55 blue colonies out of 385 total colonies, for a 

30 14.29% reversion frequency; while the heteroduplex control resulted in 38 blue 
colonies out of 510 total colonies, for a 7.45% reversion frequency. 

EXAMPLE 9 

Nick translation (and treatment with Pol I only) on M3 and M4 mutants 
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Two pUC 18 mutants of Lac Z alpha M3 and M4 (M3:SEQ ID NO: 3; 
M4:SEQ ID NO:4) were constructed without any insertion or deletion of bases 
relative to the wildtype genome. Heteroduplex DNA was generated in accordance 
with the method described in Arnold et aL, WO 99/29902; which is hereby 
5 incorporated by reference. 

Briefly, one mutant plasmid was digested with a unique restriction 
enzyme upstream of the gene and the other mutant was digested with a different 
unique restriction enzyme downstream of the gene. The linear fragments were 
combined, heated to 94°C to denature the DNA strands and cooled back to room 

10 temperature. Heteroduplex DNA circularized and homoduplex DNA remained linear. 
In this manner, heteroduplex easily distinguishable from the homoduplex DNA can be 
physically separated from the linear homoduplex by gel electrophoresis and by cutting 
out the heteroduplex band from the agarose gel. The heteroduplex DNA can then be 
purified using PromegaTM PGR Preps kit. 

15 The above experiments were carried out with both the heteroduplex 

negative control, the heteroduplex treated with Pol I only, and the heteroduplex 
plasmid treated with the nick translation enzymes (both Pol I and DNase I). Then the 
plasmid preparations were transformed into XL-1 Blue. In this experiment, you will 
note that the heteroduplex DNA treated with Pol I only shows an improved frequency 

20 of recombination over that of the heteroduplex control. The nick translation-treated 
heteroduplex molecules still fare the best. In these experiments the total number of 
blue and white colonies were counted in a defined area on the agar plates. 

The results were as follows: the nick translation reaction resulted in 46 
blue colonies out of 390 total colonies, for a 1 1.79% reversion frequency; the Pol I- 

25 only treated heteroduplex resulted in 78 blue colonies out of 829, for a 9.41% 

reversion frequency; and the heteroduplex control resulted in 46 blue colonies out of 
626 total colonies, for a 7.35% reversion frequency. 
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WHAT IS CLAIMED: 

1. A method for producing recombinant polynucleotides, which comprises: 
(a) providing variant homologous parental sequences; 
5 (b) incubating said sequences under conditions which promote heteroduplex 

formation; and 

(c) contacting said heteroduplexes with a mixture comprising a heteroduplex 
repair system consisting essentially of a nuclease and a polymerase. 

10 2. A method in accordance with claim 1 further comprising: 

(d) identifying the recombinant polynucleotides produced. 

3. A method in accordance with claim 1 wherein the heteroduplexes are 
contacted with a mixture comprising a heteroduplex repair system consisting of a 

1 5 nuclease and a polymerase. 

4. A method in accordance with claim 1 wherein the nuclease is an 
endonuclease. 

20 5. A method in accordance with claim 4 wherein the endonuclease is DNase I, 

6. A method in accordance with claim 1 wherein the nuclease is a restriction 
enzyme. 

25 7. A method in accordance with claim 1 wherein the nuclease is an exonuclease. 

8. A method in accordance with claim 1 wherein the polymerase is DNA 
polymerase I. 

30 9. A method in accordance with claim 1 wherein the nuclease is DNase I and the 
polymerase is DNA polymerase I. 

10. A method in accordance with claim 1 wherein the nuclease is a restriction 
enzyme and the polymerase is DNA polymerase I 
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11. A method in accordance with claim 1 wherein the homologous sequences are 
derived from two or more different genes. 

5 12. A method in accordance with claim 1 wherein the homologous sequences are 
derived from the same gene. 

13. A method in accordance with claim 12 wherein at least one of the homologous 
sequences is a mutant of Lac Za. 

10 

14. A method in accordance with claim 1 wherein the homologous sequences are 
on plasmids. 

15. A method in accordance with claim 14 wherein the method further comprises 
1 5 linearizing the plasmids prior to step (b). 

16. A method in accordance with claim 1 wherein the mixture further comprises 
DNA ligase. 

20 17. A method in accordance with claim 16 wherein the ligase is T4 DNA ligase. 

18. A method in accordance with claim 1 wherein the homologous parental 
sequences are DNA. 

25 19. A method in accordance with claim 1 wherein heteroduplex formation is 
achieved by heating and annealing of the homologous parental sequences. 

20. A method in accordance with claim 1 wherein heteroduplex formation is 
achieved via single strand PCR of complementary strands of the homologous parental 

30 sequences which are subsequently annealed. 

21. A method in accordance with claim 20 wherein the single strand PCR is 
carried out with more than one primer. 
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22. A method in accordance with claim 21 wherein the PCR is carried out with 
primers comprising a forward primer for one strand of a parental sequence and a 
reverse primer for the complementary sequence on another parental sequence. 

5 23. A method in accordance with claim 1 wherein heteroduplex formation 
involves amplifying the homologous parental sequences. 

24. A method in accordance with claim 23 wherein heteroduplex formation 
involves amplifying single stranded plasmid DNA homologous parental sequences. 

10 

25. A method in accordance with claim 23 wherein the amplification is carried out 
by M13-derived vectors and helper phage. 

26. A method in accordance with claim 1 wherein step (d) comprises 
15 (a) purifying a polynucleotide product of step (c); 

(b) digesting said polynucleotide product for insertion into a vector; 

(c) ligating the digested polynucleotide product into a vector; 

(d) expressing said vector in a suitable host; and 

(e) identifying polynucleotide clones exhibiting a novel characteristic or 
20 function with respect to the parental sequences; said novel polynucleotide clones 

comprising the recombinant polynucleotides. 

27. A method in accordance with claim 26 wherein the method further comprises 
isolating the recombinant polynucleotides from the novel polynucleotide clones. 

25 

28. A method for producing recombinant polynucleotides, which comprises: 

(a) providing variant homologous parental sequences; 

(b) incubating said sequences under conditions which promote heteroduplex 
formation wherein the resultant heteroduplexes are nicked; and 

30 (c) contacting said heteroduplexes with a mixture comprising a heteroduplex 

repair system consisting essentially of a polymerase. 

29. A method in accordance with claim 28 further comprising: 
(d) identifying the recombinant polynucleotides produced. 
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30. A method in accordance with claim 28 wherein the mixture consists of a 
polymerase. 

5 31. A method in accordance with claim 28 wherein the polymerase is DNA 
polymerase I. 

32. A method in accordance with claim 28 wherein the homologous parental 
sequences are DNA. 

10 

33. A method in accordance with claim 28 wherein step (d) comprises 

(a) purifying a polynucleotide product of step (c); 

(b) digesting said polynucleotide product for insertion into a vector; 

(c) lighting the digested polynucleotide product into a vector; 
15 (d) expressing said vector in a suitable host; and 

(e) identifying polynucleotide clones exhibiting a novel characteristic or 
function with respect to the parental sequences; said novel polynucleotide clones 
comprising the recombinant polynucleotides. 

20 34. A method in accordance with claim 33 wherein the method further comprises 
isolating the recombinant polynucleotides from the novel polynucleotide clones. 

35. A method of repairing mismatched nucleic acid molecules in vitro which 
comprises contacting the mismatched molecule with a mixture consisting essentially 

25 of a polymerase and a nuclease, 

36. A method in accordance with claim 35 wherein the mixtures consists of a 
polymerase and a nuclease. 

30 37. A method in accordance with claim 35 wherein the nuclease is an 
endonuclease. 

38. A method in accordance with claim 37 wherein the nuclease is DNase I. 



23 



WO 02/24953 PCT/USO 1/29030 



39. A method in accordance with claim 37 wherein the nuclease is a restriction 
enzyme. 

40. A method in accordance with claim 35 wherein the nuclease is an 
5 exonuclease. 

41. A method in accordance with claim 35 wherein the polymerase is DNA 
polymerase I. 

10 42. A method in accordance with claim 35 wherein the nuclease is DNase I and 
the polymerase is DNA polymerase I. 

43. A method in accordance with claim 35 wherein the nuclease is a restriction 
enzyme and the polymerase is DNA polymerase T. 

15 

44. A method in accordance with claim 35 wherein the homologous parental 
sequences are DNA. 

45. A method of repairing mismatched nucleic acid molecules in vitro wherein the 
20 mismatched molecule is nicked which comprises contacting the mismatched 

heteroduplexes with a mixture consisting essentially of a polymerase. 

46. A method in accordance with claim 45 wherein the mixture consists of a 
polymerase. 

25 

47. A method in accordance with claim 45 wherein the polymerase is DNA 
polymerase I. 

48. A method in accordance with claim 45 wherein the homologous parental 
30 sequences are DNA. 
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Section l 

(1) 1 10 20 30 40 I— I n56 

M2 Contio 2E ( 1) ATGATTACGAATTCGAGCTCGGTACCCGGGGATCCTCTAGAGTCGACCTGCAGGCA 
WT 3Acontlg (1) ATGATTACGAATTCGAGCTCGGTACCCGGGGATCCTCTAGAGTCGACCTGCAGGCA 
Ml Contio 1ft (1) ATGATTACGAATTCGAGCTCGGTACCCGGGGATCCTCTAGAGTCGACCTCGAGCCA 
Consensus < 1) ATGATTACGAATTCGAGCTCGGTACCCGGGGATCCTCTAGAGTCGACCTGCAGGCA 

— -Section 2 

(57) 57 n I — ill 1 80 90 100 112 

Contio 2E (57) TGCAAGCTTGGCACTGGCCGTCGTTTTACAACGTCGTGACTGGGAAAACCCTGGCG 
3Acontig ( 57) TGCAAGCTTGGCACTGGCCGTCGTTTTACAACGTCGTGACTGGGAAAACCCTGGCG 
Contio 1A (57) TG — GCTAACTAATTAAGTAATTTTTACAACGTCGTGACTGGGAAAACCCTGGCG 
Consensus ( 57) TGCAAGCTTGGCACTGGCCGTCGTTTTACAACGTCGiGACTGGGAAAACCCiGGCG 

Sectlm 3 

( 113) 113 120 130 140 150 nn nn 1168 
Contio 2E (113) TTACCCAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGTTAACTAATTAAC 
3Acontlg ( 113) TTACCCAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGTAATAGC 
Contio 1A (110) TTACCCAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGTAATAGC 
Consensus ( 113) TTACCCAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGTAATAGC 

_ — Section 4 

( I69) lft9 I! 180 190 200 210 224 

Contio 2E (169) TAAGATATCGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGA 
3Acontig ( 169) GAAGAG — GCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGA 
Contio 1A (166) GAAGAG — GCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGA 
Consensus ( 169) GAAGAG — GCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGA 

Section 5 

(225) 225 230 240 250 260 270 280 

Contio 2£ (225) ATGGCGCCTGATGCGGTATTTTCTCCTTACGCATCTGTGCGGTATTTCACACCGCA 
3Acontio ( 222) ATGGCGCCTGATGCGGTATTTTCTCCTTACGCATCTGTGCGGT ATTTCACACCGC A 
Contio 1A (219) ATGGCGCCTGATGCGGTATTTTCTCCTTACGCATCTGTGCGGTATTTCACACCGCA 
Consensus ( 225) ATGGCGCCTGATGCGGTATTTTCTCCTTACGCATCTGTGCGGTATTTCACACCGCA 

Section 6 

(281) 281 290 300 310 321 
Contio 2E (281) TATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAG SEQ ID N0» 2 
3Acontlg (278) TATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAG SEQ ID 10 5 
Contig 1A (275) TATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAG SEQ ID It 1 
Consensus ( 281) TATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAG SEQ ID NB8 



n above changes in sequence 

FIG. 1. 
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From left, 5uL 1KB Ladder; 2uL digested ssDNA heteroduplex control, 2uL 
digested dsDNA heteroduplex control; 2uL digested ssDNA heteroduplex after nick 
translation; 2uL digested dsDNA heteroduplex after nick translation with ligase; 2uL 
digested dsDNA heteroduplex after nick translation; 3.5uL Lac Zl insert DNA; 3.5uL 
ssDNA heteroduplex after nick translation; 3.5uL dsDNA heteroduplex after nick 
translation with ligase; 3.5uL dsDNA heteroduplex after nick translation. 

FIG. 2. 
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Section 1 

(1) 1 10 20 30 40 56 

M3 FulllALl (1) ATGATTACGAATTCGAGCTCGGTACCCGGGGATCCTCTAGAGTCGACCTGCAGGCA 
WT 3Acontig (1) ATGATTACGAATTCGAGCTCGGTACCCGGGGATCCTCTAGAGTCGACCTGCAGGCA 
M4 ful 12q7 (1) ATGATTACGAATTCGAGCTCGGTACCCGGGGATCCTCTAGAGTCGACCTGCAGGCA 
Consensus ( 1) ATGATTACGAATTCGAGCTCGGTACCCGGGGATCCTCTAGAGTCGACCTGCAGGCA 

_ Sectim I 

(57) 57 I — lUl 180 90 100 112 

Ful 1 1AL1 (57) TGCAAGCTAACTAATTAAGTAAGTTTTACAACGTCGTGACTGGGAAAACCCTGGCG 
3Acont I o (57) TGCAAGCTTGGCACTGGCCGTCGTTTTACAACGTCGTGACTGGGAAAACCCTGGCG 
ful I2a7 (57) TGCAAGCTTGGCACTGGCCGTCGTTTTACAACGTCGTGACTGGGAAAACCCTGGCG 

r> /c->\ Trr*A hrrrrrrr »PTrrn(VTnrTTTT»r» Krrrrrrr APTrrrA a a hrrnrrrr 

ujf beiibUb kjii I ulhhuu I I uui/HU I uuuou i tu ii i i nunnou i uu i unu i uuunnnnoou i uuou 

SectliiEL 3 

(113) 113 120 130 140 150 nnn 168 
FulllALl ( 1 13) TTACCCAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGTAATAGC 
3Acorrtlg (113) TT ACCCAACTT AATCGCCTTGCAGCAC ATCCCCCTTTCGCCAGCTGGCGTAAT AGC 

ful 12q7 (113) TTACCCAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCTAACTAATTAAC 
Consensus (113) TTACCCAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGTAATAGC 

=— = Section 4 

( 169) 1169 II 180 190 200 210 224 

Ful 1 1AL1 (169) GAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGAATG 
3Acontlg ( 169) GAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGAATG 

ful 12o7 ( 169) TAAGTGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGAATG 
Consensus ( 169) GAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGAATG 

Section 5 

(225) 225 230 240 250 260 270 280 
Ful 1 1AL1 (225) GCGCCTGATGCGGTATTTTCTCCTTACGCATCTGTGCGGTATTTCACACCGCATAT 
3Acont i o (225) GCGCCTGATGCGGTATTTTCTCCTTACGCATCTGTGCGGTATTTCACACCGCATAT 

ful I2a7 (225) GCGCCTGATGCGGTATTTTCTCCTTACGCATCTGTGCGGTATTTCACACCGCATAT 
Consensus (225)GCGCCTGATGCGGTATTTTCTCCTTACGCATCTGTGCGGTATTTCACACCGCATAT 

Section 6 

(PRO 281 290 300 318. 

Ful 1 1AL1 (281) GGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAG SEQ ID Ntt 3 
3Acontlg (278) GGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAG SEQ ID Nft 5 

ful I2a7 (281) GGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAG SEQ ID Ntt 4 
Consensus (281) GGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAG SEQ ID Ntt 9 

I I above changes in sequence 

FIG. 3. 
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Parent A Parent B 

Restriction Restriction Restriction Restriction 

Site 1 Site 2 Site 1 Site 2 



E23 ESafEHS fht^i i-i^i 



S3 
^ -Hi 



mutation in A 



< n m 1 1 1 1 n 1 1 1 1 1 m 1 1 i ^gft inn 



mutation in B 



=i top strand 
h bottom strand 



Heat to melt into single strands, 
Cool to anneal random pairs 
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h- 



1 1 1 1 1 1 1 1 n 1 1 1 1 n 1 1 1 1 j j Ti u 1 1 1 1 
homoduplexes 



ES2^ES3 

heteroduplexes 



0 



Optional step: isolate heteroduplexes 



i ii i inn i n m i i i hi^ i m ii 



E23 E2^E5E5I EES E 7 ?^ 

heteroduplexes 



0 



FIG. 8A. 
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Working with just one heteroduplex 



Perform nicking reaction to 
create gaps in the sequence 




Add polymerase to digest locally 
at nicks and fill in with copy of 
opposite strand 



One of four homoduplex outcomes 



E^ m i ll ll ll l ll l lll ll 



Vector DNA 



Cut with restriction enzymes at cut 
sites 1 and 2, add vector and ligase, 
and clone into plasmld vector 



E^^^g^ ^ 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 ^^^ ^ ^^^^^ a+w ^^^^sa 

^ESteM^ 1 1 1 1 1 1 1 1 1 1 1 1 ! I ! 1 1 1 1 1 1 1 111 ll ES^J^rt^ 1 1 1 1 1 M I fei^^^*^ 



FIG. 8B. 
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SEQUENCE LISTING 

<110> Merck & Co., Inc. 

<120> METHOD FOR GENERATING RECOMBINANT 
POLYNUCLEOTIDES 

<130> 20746 PCT 

<140> 60/234,439 
<141> 2000-09-21 

<160> 9 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 315 
<212> DNA 

<213> Escherichia coli 
<400> 1 

atgattacga attcgagctc ggtacccggg gatcctctag agtcgacctc gagccatggc 60 
taactaatta agtaattttt acaacgtcgt gactgggaaa accctggcgt tacccaactt 120 
aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc 180 
gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgcctgat gcggtatttt 240 
ctccttacgc atctgtgcgg tatttcacac cgcatatggt gcactctcag tacaatctgc 300 
tctgatgccg catag 315 

<210> 2 
<211> 321 
<212> DNA 

<213> Escherichia coli 
<400> 2 

atgattacga attcgagctc ggtacccggg gatcctctag agtcgacctg caggcatgca 60 
agcttggcac tggccgtcgt tttacaacgt cgtgactggg aaaaccctgg cgttacccaa 120 
cttaatcgcc ttgcagcaca tccccctttc gccagttaac taattaacta agatatcgcc 180 
cgcaccgatc gcccttccca acagttgcgc agcctgaatg gcgaatggcg cctgatgcgg 240 
tattttctcc ttacgcatct gtgcggtatt tcacaccgca tatggtgcac tctcagtaca 300 
atctgctctg atgccgcata g 321 

<210> 3 
<211> 318 
<212> DNA 

<213> Escherichia coli 
<400> 3 

atgattacga attcgagctc ggtacccggg gatcctctag agtcgacctg caggcatgca 60 
agctaactaa ttaagtaagt tttacaacgt cgtgactggg aaaaccctgg cgttacccaa 120 
cttaatcgcc ttgcagcaca tccccctttc gccagctggc gtaatagcga agaggcccgc 180 
accgatcgcc cttcccaaca gttgcgcagc ctgaatggcg aatggcgcct gatgcggtat 240 
tttctcctta cgcatctgtg cggtatttca caccgcatat ggtgcactct cagtacaatc 300 
tgctctgatg ccgcatag 318 
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<210> 4 
<211> 318 
<212> DNA 

<213> Escherichia coli 
<400> 4 

atgattacga attcgagctc ggtacccggg gatcctctag agtcgacctg caggcafcgca 60 
agcttggcac tggccgtcgt tttacaacgt cgtgactggg aaaaccctgg cgttacccaa 120 
cttaatcgcc ttgcagcaca tccccctttc gccagctaac taattaacta agtggcccgc 180 
accgatcgcc cttcccaaca gttgcgcagc ctgaatggcg aatggcgcct gatgcggtat 240 
tttctcctta cgcatctgtg cggtatttca caccgcatat ggtgcactct cagtacaatc 300 
tgctctgatg ccgcatag 318 

<210> 5 
<211> 318 
<212> DNA 

<213> Escherichia coli 



<210> 6 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 6 

tgagcgtcgt ttttgtgat 19 

<210> 7 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 



<400> 5 



atgattacga attcgagctc ggtacccggg gatcctctag agtcgacctg caggcatgca 

agcttggcac tggccgtcgt tttacaacgt cgtgactggg aaaaccctgg cgttacccaa 

cttaatcgcc ttgcagcaca tccccctttc gccagctggc gtaatagcga agaggcccgc 

accgatcgcc cttcccaaca gttgcgcagc ctgaatggcg aatggcgcct gatgcggtat 

tttctcctta cgcatctgtg cggtatttca caccgcatat ggtgcactct cagtacaatc 
tgctctgatg ccgcatag 



60 
120 
180 
240 
300 
318 



<400> 7 

taggggttcc gcgcacattt 



20 



<210> 8 

<211> 318 

<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Consensus sequence 



<400> 8 



atgattacga 
agcttggcac 
cttaatcgcc 
accgatcgcc 
tttctcctta 
tgctctgatg 



attcgagctc ggtacccggg gatcctctag agtcgacctg caggcatgca 

tggccgtcgt tttacaacgt cgtgactggg aaaaccctgg cgttacccaa 

ttgcagcaca tccccctttc gccagctggc gtaatagcga agaggcccgc 

cttcccaaca gttgcgcagc ctgaatggcg aatggcgcct gatgcggtat 

cgcatctgtg cggtatttca caccgcatat ggtgcactct cagtacaatc 
ccgcatag 



60 
120 
180 
240 
300 
318 



<210> 9 
<211> 318 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Consensus sequence 
<400> 9 

atgattacga attcgagctc ggtacccggg gatcctctag agtcgacctg caggcatgca 60 
agcttggcac tggccgtcgt tttacaacgt cgtgactggg aaaaccctgg cgttacccaa 120 
cttaatcgcc ttgcagcaca tccccctttc gccagctggc gtaatagcga agaggcccgc 180 
accgatcgcc cttcccaaca gttgcgcagc ctgaatggcg aatggcgcct gatgcggtat 240 
tttctcctta cgcatctgtg cggtatttca caccgcatat ggtgcactct cagtacaatc 300 
tgctctgatg ccgcatag 318 
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